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Introduction 

Human  LINE-1  (LlHs)  is  a  transposable  element  that  encodes  a  reverse  transcriptase  and 
moves  via  an  RNA  intermediate  [1].  It  therefore  seems  possible  that  cells  in  which  LlHs  is 
active  may  be  subject  to  insertional  mutagenesis.  We  have  recently  found  that  the  element  is 
expressed  in  a  significant  number  of  germ  cell  cancers  [2,3],  as  well  as  in  many  breast 
tumors  and  breast  tumor  cell  lines  [4] .  This  last  finding  raises  the  possibility  that  the 
initiation  or  progression  of  some  breast  cancers  is  facilitated  by  LlHs-mduced  insertional 
mutations. 

In  addition  to  insertional  mutagenesis,  there  are  several  other  characteristics  of  the  LlHs 
element  that  suggest  its  potential  as  an  oncogenic  agent.  For  example,  LlHs  has  an  internal 
promoter  which  could  potentially  lead  to  readthrough  transcription  and  activation  of 
downstream  genes.  In  addition,  the  LlHs-encoded  p40  protein  has  a  leucine  zipper  motif, 
suggesting  a  possible  interaction  with  other  cellular  proteins.  Such  interactions  at 
inappropriate  times  might  lead  to  the  disruption  of  important  cellular  functions.  Thus,  LlHs 
involvement  in  cancer  could  occur  by  several  mechanisms,  either  singly  or  in  combination. 

Our  long  range  goals  focus  on  the  identification  and  characterization  of  LlHs  elements  that 
are  expressed  in  breast  cancers.  LlHs  expression  may  identify  a  class  of  breast  neoplasms  in 
which  LlHs-encoded  proteins  initiate  or  maintain  the  neoplastic  state.  This  suggests  that 
down-regulating  LlHs  expression  could  influence  the  course  of  breast  cancer.  To  approach 
this  problem  we  have  proposed  to  study  the  expression  of  LlHs  in  cell  lines  and  in  solid 
tumors  by  cloning  and  characterizing  actively  expressed  LlHs  elements.  Specifically  we  have 
proposed  to: 

(1)  Clone  active  LlHs  elements  from  several  breast  cancer  cell  lines. 

(2)  Characterize  the  active  LlHs  sequences  and  determine  if  the  same  LlHs  elements  are 
active  in  solid  tumors. 

Experimental 

Isolation  of  hypomethylated  LlHs  elements. 

Our  original  plan  was  to  isolate  unmethylated  (potentially  active)  LlHs  elements  by  the 
method  of  inverse  polymerase  chain  reaction  (PCR)  [5],  and  this  approach  has  worked  quite 
well.  Our  current  protocol  runs  as  follows:  we  partially  cleave  DNA  extracted  from  the 
T47D  breast  tumor  line  with  Hpall,  an  enzyme  that  appears  to  cleave  LlHs  sequences  only 
in  transcriptionally  active  cells  [6].  We  then  isolate  fragments  from  0.3- 1.3  kb  in  size  on 
acrylamide  gels  and  circularize  the  fragments  by  incubation  with  T4  DNA  ligase  under 
conditions  of  dilute  DNA  concentration  (1-2  ng/jul).  We  collect  the  circularized  DNAs  and 
cleave  with  the  restriction  enzyme  BgH  (BgB  has  a  consensus  site  located  at  position  10-20 
on  the  LlHs  consensus  sequence;  the  enzyme  introduces  cuts  between  nucleotides  13/14  and 
16/17  giving  3’  overhangs)  and  perform  PCR  under  relaxed  conditions  with  primers  that  are 
complementary  to  regions  3-16  and  14-27  on  the  LlHs  consensus  sequence.  This  results  in 
PCR  products  that  contain  5’  flanking  DNA  bracketed  by  LlHs  sequences.  In  most  cases  the 
LlHs  moiety  of  the  clone  stretches  from  bp  1  to  a  Hpall  site  at  bp  36. 

Following  inverse-PCR  and  cloning  of  the  products  into  pBluescript  KS  or  pDKlOl  [7], 
we  isolated  12  unique  clones  deemed  suitable  for  analysis  (Tables  I  and  II,  pp.  10-12).  We 
use  the  following  nomenclamre  when  describing  the  cloned  sequences:  when  discussing  the 
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clone,  without  regard  to  the  LlHs  or  the  flanker  moiety,  no  additional  designation  is  given 
{e.g. ,  clone  593B).  When  referring  only  to  the  LlHs  moiety  of  the  clone,  an  -LI  suffix  is 
provided  {e.g. ,  593B-L1).  When  referring  only  to  the  sequence  flanking  the  LlHs  moiety,  an 
-FI  suffix  is  provided  {e.g.,  593B-F1). 

Sequences  flanking  the  hypomethylated  LlHs  elements. 

The  sequences  directly  flanking  the  5’  end  of  the  12  hypomethylated  LlHs  elements  were 
of  special  interest  to  us  since  it  is  possible  that  they  harbor  regulatory  signals  that  distinguish 
these  LlHs  elements  from  the  bulk  of  the  elements  in  the  genome.  Therefore,  we  sequenced 
and  analyzed  about  220  bp  of  each  flanker  directly  adjacent  to  the  LlHs  element.  The 
sequences  were  unremarkable.  In  general  they  were  A-l-T-rich  (average  =  61%;  range  = 
51-75%)  with  no  large  regions  of  strand  asymmetry  {e.g.,  no  polypurine  runs).  Although  the 
occasional  clone  had  a  transcriptional  regulatory  motif  {e.g.,  CAAT),  most  did  not.  It  is 
possible,  of  course,  that  uncommon  variants  of  a  common  motif  are  present  in  the  flanking 
sequences,  or  that  novel  regulatory  signals  are  present  and  not  recognizable.  We  have  begun 
to  analyze  selected  flankers  in  promoterless  and  enhancerless  expression  plasmids  to  test 
these  possibilities. 

A  GenBank  search  provided  matches  with  only  three  of  the  flanker  sequences.  893J-F1 
consists  entirely  of  an  Alu  element.  893I-F1  contains  multiple  copies  of  the  four  bp  sequence, 
TGGA.  593F-F1  consists  of  a  novel  repetitive  sequence  (GASP  repeat)  that  is  being 
characterized  by  us.  Currently  we  know  that  the  repeat  is  present  in  about  250-300  copies  in 
Old  World  monkeys  and  is  absent  in  New  World  monkeys.  In  humans,  copies  of  the 
sequence  are  located  on  all  chromosomes,  including  the  Y  chromosome. 

Chromosomal  location  of  hypomethylated  LlHs  elements. 

We  prepared  primers  specific  for  each  flanker  and  used  them  together  with  an  LlHs- 
specific  primer  to  amplify  LiTfo-flanker  fragments  in  somatic  cell  hybrid  DNAs,  each 
containing  a  specific  human  chromosome  in  a  rodent  background  (Coriell  Mapping  Panel 
#2).  Eleven  of  the  twelve  flankers  were  assigned  this  way  to  specific  chromosomes  (Table  I). 
The  11  hypomethylated  LlHs  elements  are  found  on  7  different  chromosomes,  with  two  each 
on  chromosomes  5  and  8  and  three  on  chromosome  15.  This  non-random  distribution  could 
reflect  a  non-randomness  in  the  hypomethylation  of  various  regions  of  the  human  genome 
during  tumorigenesis  [8,9]. 

One  locus,  893N,  yielded  no  PCR  product  with  any  of  the  chromosome-specific  hybrid 
DNAs,  although  DNA  isolated  from  the  human  cell  line  used  to  make  the  hybrids  did  give  a 
signal.  Thus,  the  locus  containing  893N  must  have  been  deleted  sometime  subsequent  to  the 
fusion  of  the  human  and  rodent  cells.  Other  workers  have  previously  reported  that  some 
chromosomal  loci  are  deleted  in  the  Mapping  Panel  #2  lines  [10]. 

Methylation  status  of  hypomethylated  LlHs  elements  in  different  cell  lines. 

We  were  able  to  analyze  the  methylation  status  of  9  loci  in  a  number  of  cell  lines  (Table 
II).  The  methylation  stams  of  three  loci  (593F,  8931,  893 J)  could  not  be  ascertained  since  the 
flankers  contained  sequences  that  are  multicopy  in  the  human  genome.  In  Table  II  a  minus 
sign  indicates  the  sequence  is  primarily  unmethylated  while  a  minus/plus  indicates  that  the 
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locus  was  partially  unmethylated  and  partially  methylated.  This  could  be  due  to  differential 
methylation  of  the  two  alleles,  or  to  differential  methylation  of  the  locus  in  different  cells,  or 
both.  Note  also  that  many  of  these  cells  are  aneuploid  and  the  copy  number  of  the  loeus 
being  probed  is  not  necessarily  two/cell.  Pluses  in  Table  II  indieate  that  the  loeus  is 
methylated:  a  single  plus  indieates  that  a  defined,  high  molecular  weight  band  was  present, 
while  triple  plus  indicates  that  only  a  smear  of  hybridizing  material  was  seen  at  the  very  top 
of  the  lane.  A  double  plus  indicates  an  intermediate  condition. 

The  first  conelusion  from  Table  II  is  that  certain  loci  are  hypomethylated  in  a  variety  of 
malignant  eell  lines.  For  example,  locus  593E  appears  to  be  hypomethylated  in  all  eell  lines 
except  those  derived  from  germ  cell  tumors.  Thus,  593E  may  represent  a  locus  that  is 
demethylated  late  in  embryogenesis.  593B  has  a  pattern  similar  to  that  of  593E,  but  it  is 
partially  hypomethylated  in  one  germ  cell  line  and  is  methylated  in  several  non-germ  cell 
lines.  893Y  is  methylated  in  all  lines  exeept  T47D  (breast)  and  DAOY  (medulloblastoma). 
593C  is  not  methylated  in  cell  lines  originating  from  breast  tumors,  but  is  methylated  in 
almost  every  other  eell  line. 

The  one  non-breast  cell  line  where  593C  is  also  hypomethylated  is  the  medulloblastoma 
line,  DAOY.  An  examination  of  Table  II  demonstrates  that  the  T47D  and  DAOY  lines  have 
virtually  identieal  methylation  patterns  for  all  nine  hypomethylated  loci.  In  a  separate  series 
of  experiments,  using  hypomethylated  LlHs  flankers  isolated  from  NTera2Dl  cells,  we  found 
a  clone  that  demonstrated  an  RFLP  between  the  T47D  and  DAOY  cell  line  DNAs  with  the 
restriction  enzyme  Msp\.  Thus,  the  DAOY  DNA  is  clearly  not  T47D  DNA  that  had  been 
mislabeled.  We  have  no  explanation  for  the  very  similar  hypomethylation  patterns  in  these 
two  cell  lines  derived  from  quite  different  tumors. 

The  second  conclusion  from  Table  II  is  that  two  subsets  of  hypomethylated  loci  are 
present:  one  in  somatic  cells  and  one  in  germ  cells.  With  only  a  few  exceptions,  all  of  the 
loci  that  are  hypomethylated  in  various  malignant  and  non-malignant  somatic  cells  are  fully 
methylated  in  the  germ  eells.  This  indicates  that  some  regions  of  the  genome  that  are 
demethylated  in  tumors  derived  from  somatie  cells  remain  highly  methylated  in  germ  cell 
tumors.  In  this  respeet  it  is  interesting  to  note  that  all  of  the  loei  that  are  hypomethylated  in 
T47D  cells  are  highly  methylated  in  the  NTera2Dl  cell  line.  Two  loei  (8930  and  893V)  are 
hypomethylated  in  both  2102Ep  and  PA-1,  and  one  (593B)  is  hypomethylated  only  in  PA-1. 
The  reason  for  the  complete  methylation  of  the  loci  in  NTera2Dl,  but  not  in  the  other  two 
germ  cell  lines,  is  presently  unknown.  One  possible  explanation  stems  from  the  faet  that  the 
NTera2Dl  line  is  capable  of  retinoic  acid  induced  differentiation  and  might  reflect  the  real 
embryonic  cell  more  faithfully  that  either  of  the  other  two  germ  cell  lines. 

Originally,  we  expected  to  find  loei  that  were  hypomethylated  because  they  had  been 
involved  in  amplifieation  events.  However,  we  found  only  one  such  case:  593C  exhibits  an 
approximate  10-15  fold  amplifieation  in  the  SK-BR-3  line.  593C  is  loeated  on  chromosome  8 
(Table  I).  The  long  arm  of  chromosome  8  is  known  to  be  amplified  in  a  number  of  primary 
breast  tumors  and  the  8q21-8q24  region  is  amplified  in  the  SK-BR-3  cell  line  [11]. 

Therefore,  it  seems  likely  that  583C  is  within  the  8q21-8q24  region. 

Cloning  of  full-length  hypomethylated  LlHs  elements. 

We  have  screened  a  human  genomic  library  in  phage  lambda  with  some  of  the  flanker 
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sequences  described  above.  To  date  we  have  lambda  clones  for  the  following  hypomethylated 
elements:  593C  (11  clones  of  which  10  have  been  restriction  mapped),  593D  (2  clones;  1 
mapped),  593E  (8  clones;  4  mapped),  593F  (2  clones;  0  mapped).  These  clones  will  be  used 
for  coupled  transcription/translation  assays  to  determine  if  the  associated  LlHs  elements  are 
capable  of  producing  intact,  functional  proteins. 

In  addition,  we  have  isolated  40  lambda  phage  that  contain  CASP  sequences  (see  above) 
selected  randomly  from  the  genome,  and  33  lambda  phage  with  CASP  sequences  located  on 
chromosome  9.  We  are  currently  attempting  to  determine  if  the  CASP  repeat  is  involved  in 
any  aspect  of  cellular  function. 

Conclusions 

We  have  partially  characterized  a  number  of  LlHs  elements  that  may  be  active  in  several 
breast  cancer  cell  lines.  Our  results  suggest  that  LlHs  elements  that  are  unmethylated  in 
breast  tumors  are  methylated  in  germ  cell  tumors.  This  suggests  that  specific,  non¬ 
overlapping  sets  of  elements  are  active  in  different  cancers.  We  are  currently  testing  this  by 
investigating  unmethylated  LlHs  elements  from  an  embryonal  carcinoma  cell  line  and  from  a 
medulloblastoma  cell  line.  If  there  are  cancer-specific  patterns  of  LlHs  expression,  the 
probes  we  are  developing  today  may  be  useful  as  clinical  tools  in  the  future. 

The  intent  of  the  present  work  was  to  characterize  the  LlHs  elements  that  are  differentially 
hypomethylated  in  malignant  cell  lines.  Hypomethylation  of  LlHs  elements  or  surrounding 
sequences  may  be  important  in  malignant  transformation,  especially  since  demethylation  of 
genomic  DNA  appears  to  be  an  early  event  in  tumor  progression  [8,9].  Of  an  estimated  120 
hypomethylated  LlHs  elements  per  T47D  cell,  we  have  isolated  and  partially  characterized 
twelve.  If  these  twelve  elements  are  representative,  then  they  should  provide  a  picture  of  the 
hypomethylated,  potentially  active,  LlHs  elements  in  malignant  cells. 

Our  results  suggest  that  there  are  many  LlHs  elements  that  can  he  demethylated  during 
tumorigenesis,  and  some  of  these  elements  are  demethylated  in  a  variety  of  cell  lines  from 
independent  tumors.  For  example,  the  locus  593C  is  hypomethylated  in  three  breast  cancer 
cell  lines  and  one  brain  cancer  cell  line,  but  is  methylated  in  all  other  lines  (Table  II).  Locus 
893V  is  hypomethylated  in  one  of  three  non-malignant  cell  lines  and  two  of  three  breast  cell 
lines,  two  of  three  germ  cell  lines  and  two  of  three  brain  cell  lines  (Table  II).  Thus,  our 
findings  eliminate  many  models  that  might  have  described  hypomethylated  LlHs  elements  in 
malignant  cells:  for  example,  that  each  cell  line  has  a  different  subset  of  hypomethylated 
elements. 

Why  certain  loci  are  hypomethylated  in  numerous  malignant  cell  lines  is  presently 
unknown.  These  regions  of  the  genome  may  be  demethylated  randomly,  then  held  in  a 
demethylated  state  by  selection.  Selection  could  result  from  the  presence  of  a  gene  or  LlHs 
element  in  the  hypomethylated  region  whose  transcription  is  necessary  to  maintain  the 
malignant  state.  We  note  that  LlHs  need  not  necessarily  act  as  an  insertion  mutagen  in  order 
to  modify  the  cellular  phenotype.  The  element  encodes  at  least  two  proteins  whose  functions 
(with  the  exception  of  the  reverse  transcriptase  encoded  by  ORF2)  are  largely  unknown.  The 
p40  protein,  the  reverse  transcriptase,  or  possibly  other  polypeptides  originating  from  ORF2, 
could  be  involved  in  the  initiation  or  maintenance  of  the  malignant  state,  either  alone  or  in 
combination  with  other  gene  products. 
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Notes  to  Table  I. 


LlHs  (bp):  size  of  the  LlHs  moiety  in  clone. 

Flank  (bp):  size  of  the  DNA  flanking  the  5’  end  of  the  LlHs  element. 
Copy  no.:  estimated  copy  number  of  flank  DNA. 

*:  Hpall  site  at  position  36  is  CCGG,  but  not  cut  by  enzyme. 

?:  could  not  be  determined. 


Notes  to  Table  II. 

Type:  normal  is  non-malignant  breast  epithelial  (HBL-100,  MCFIOA)  or  fibroblast 
(GM0327);  breast  is  adenocarcinoma  (MCF7,  SK-BR-3)  or  infiltrating  ductal  carcinoma  (T- 
47D);  brain  is  medulloblastoma  (DAOY,  TE671)  or  glioblastoma  (A172);  germ  cell  is 
testicular  embryonal  carcinoma  (2102Ep,  NTera2Dl)  or  ovarian  embryonal  carcinoma  (PA- 
1). 

Scoring  system  (-,  -/+,  etc.)  is  given  in  text. 

?:  could  not  be  determined. 

XX :  locus  rearranged  in  this  cell  line. 
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Table  I.  Location  and  Size  of  Clones 


clone 

chromosome 

LIHs  (bp) 

flank  (bp) 

copy  no. 

593B 

5 

103 

1200 

single 

593C 

8 

36 

500 

single 

593D 

4 

103* 

500 

single 

593E 

8 

36 

700 

single 

593F 

9 

36 

400 

250-300 

8931 

15 

103* 

350 

multicopy 

893J 

15 

103 

350 

multicopy 

893N 

? 

36 

200 

single 

8930 

7 

36 

400 

single 

893V 

15 

36 

450 

single 

893W 

5 

36 

350 

single 

893Y 

3 

103 

150 

single 
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Table  II.  Methylation  Status  of  Clones 
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